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channels, where Te and Th are tau leptons reconstructed in the electronic and hadronic 
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1 Introduction 

The discovery of the Higgs boson (H) HHSl has generated great interest in exploring its prop¬ 
erties. In the standard model (SM), lepton-flavour-violating (LFV) decays are forbidden if the 
theory is to be renormalizable 01. If this requirement is relaxed, so the theory is valid only 
to a finite mass scale, then LFV couplings may be introduced. LFV decays can occur natu¬ 
rally in models with more than one Higgs doublet without abandoning renormalizability |5|. 
They also arise in supersymmetric models ISHll, composite Higgs boson models lUoUTTl . mod¬ 
els with flavour symmetries Kl^ . Randall-Sundrum models llSHTSl, and many others Ifl6] - l23l . 
The presence of LFV couplings would allow > e, t —> /r and t —> e transitions to proceed via 
a virtual Higgs boson I1M1|25| . The experimental limits on these have recently been translated 
into constraints on the branching fractions jS(H ^ e}i, }iT,eT) 01|26l. The > e transition is 
strongly constrained by null search results for ji ^ ej 127] , f3(H ^ ^e) < 0(10^^). How¬ 
ever, the constraints on t —> and T —> e are much less stringent. These come from searches 
for T —> /^7 |l28l|29l and other rare T decays KSOll . t —> e 7 , and e g — 2 measurements Il27l . 
Exclusion limits on the electron and muon electric dipole moments fSTl also provide comple¬ 
mentary constraints. These lead to the much less restrictive limits: jB(H —> }it) < 0(10%), 
6(H ^ er) < 0(10%). The observation of the Higgs boson offers the possibility of sensitive 
direct searches for LFV Higgs boson decays. To date no dedicated searches have been per¬ 
formed. However, a theoretical reinterpretation of the ATLAS H —> tt search results in terms 
of LFV decays by an independent group has been used to set limits at the 95% confidence level 
(CL) of 6(H ^ ^T) < 13%, 6(H ^ er) < 13% @1. 

This letter describes a search for a LFV decay of a Higgs boson with Mh = 125 GeV at the CMS 
experiment. The 2012 dataset collected at a centre-of-mass energy of \/s = 8 TeV corresponding 
to an integrated luminosity of 19.7 fb^^ is used. The search is performed in two channels, 
H ^ fiTe and H —> f/Th, where Te and Th are tau leptons reconstructed in the electronic and 
hadronic decay channels, respectively. The signature is very similar to the SM H —)■ r^Te and 
H ^ Tj,Th decays, where is a tau lepton decaying muonically, which have been studied by 
CMS in Refs. 13^ I33II and ATLAS in Ref. 1341 , but with some significant kinematic differences. 
The f/ comes promptly from the LFV H decay and tends to have a larger momentum than in 
the SM case. There is only one tau lepton so there are typically fewer neutrinos in the decay. 
They are highly Lorentz boosted and tend to be collinear with the visible r decay products. 

The two channels are divided into categories based on the number of jets in order to separate 
the different H boson production mechanisms. The signal sensitivity is enhanced by using dif¬ 
ferent selection criteria for each category. The dominant production mechanism is gluon-gluon 
fusion but there is also a significant contribution from vector boson fusion which is enhanced 
by requiring jets to be present in the event. The dominant background in the H —> jiTe chan¬ 
nel is Z —> TT. Other much smaller backgrounds come from misidentified leptons in W+jets, 
QCD multijets and tt events. In the H —> channel the dominant background arises from 

misidentified t leptons in W+jets, QCD multijets and tt events. Less significant backgrounds 
come from Z —> tt and Z+jets. The principal backgrounds are estimated using data. There is 
also a small background from SM H decays which is estimated with simulation. The presence 
or absence of a signal is established by fitting a mass distribution for signal and background 
using the asymptotic CLg criterion Il35ll3^ . A "blind" analysis was performed. The data in the 
signal region were not studied until the selection criteria had been fixed and the background 
estimate finalized. 
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3 Event reconstruction 


2 Detector and data sets 

A detailed description of the CMS detector, together with a description of the coordinate sys¬ 
tem used and the relevant kinematic variables, can be found in ref. 113 711 . The momenta of 
charged particles are measured with a silicon pixel and strip tracker that covers the pseudora¬ 
pidity range \rj\ < 2.5 and is inside a 3.8 T axial magnetic field. Surrounding the tracker are a 
lead tungstate crystal electromagnetic calorimeter (ECAL) and a brass/scintillator hadron cal¬ 
orimeter, both consisting of a barrel assembly and two endcaps that extend to a pseudorapidity 
range of It]] < 3.0. A steel/quartz-fiber Cherenkov forward detector extends the calorimetric 
coverage to |7/| < 5.0. The outermost component of the CMS detector is the muon system, 
consisting of gas-ionization detectors placed in the steel flux-return yoke of the magnet to mea¬ 
sure the momenta of muons traversing the detector. The two-level CMS trigger system selects 
events of interest for permanent storage. The first trigger level, composed of custom hard¬ 
ware processors, uses information from the calorimeters and muon detectors to select events in 
less than 3.2 jis. The high-level trigger software algorithms, executed on a farm of commercial 
processors, further reduce the event rate using information from all detector subsystems. 

The H —> /iTh channel selection begins by requiring a single ji trigger with a transverse momen¬ 
tum threshold Pj > 24GeV in the pseudorapidity range \r]\ < 2.1, while the H —)■ ^Te channel 
requires a p-e trigger with pj thresholds of 17GeV {\r]\ < 2.4) for the p and 8GeV {\t]\ < 2.5) 
for the e. Loose e and p identification criteria are applied at the trigger level. The leptons are 
also required to be isolated from other tracks and calorimeter energy deposits to maintain an 
acceptable trigger rate. 

Simulated samples of signal and background events are produced using various Monte Garlo 
(MG) event generators, with the GMS detector response modeled with Geant4 |[38]| . Higgs 
bosons are produced in proton-proton collisions predominantly by gluon-gluon fusion, but 
also by vector boson fusion and in association with a W or Z boson. It is assumed that the 
rate of new decays of the H are sufficiently small that the narrow width approximation can be 
used. The LEV H decay samples are produced with pythia 8.175 1391 . The background event 
samples with a SM H are generated by powheg 1.0 Il40l - l44l with the t decays modelled by 
TAUOLA Ii45l . The MadGraph 5.1 generator is used for Z+jets, W+jets, tt, and diboson 
production, and POWHEG for single top-quark production. The POWHEG and MadGraph 
generators are interfaced with pythia for parton shower and fragmentation. 


3 Event reconstruction 

A particle-flow (PE) algorithm Il47ll48l combines the information from all GMS sub-detectors to 
identify and reconstruct the individual particles emerging from all vertices: charged hadrons, 
neutral hadrons, photons, muons, and electrons. These particles are then used to reconstruct 
jets, hadronic t decays, and to quantify the isolation of leptons and photons. The missing 
transverse energy vector is the negative vector sum of all particle transverse momenta and its 
magnitude is referred to as The variable AR = \/(Arj)^ + used to measure the 

separation between reconstructed objects in the detector, where (p is the azimuthal angle (in 
radians) of the trajectory of the object in the plane transverse to the direction of the proton 
beams. 

The large number of proton interactions occurring per LHG bunch crossing (pileup), with 
an average of 21 in 2012, makes the identification of the vertex corresponding to the hard- 
scattering process nontrivial. This affects most of the object reconstruction algorithms: jets, 
lepton isolation, etc. The tracking system is able to separate collision vertices as close as 0.5 mm 
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along the beam direction Il4^ . For each vertex, the sum of the pj of all tracks associated with 
the vertex is computed. The vertex for which this quantity is the largest is assumed to cor¬ 
respond to the hard-scattering process, and is referred to as the primary vertex in the event 
reconstruction. 

Muons are reconstructed using two algorithms 1501 : one in which tracks in the silicon tracker 
are matched to signals in the muon detectors, and another in which a global track fit is per¬ 
formed, seeded by signals in the muon systems. The muon candidates used in the analysis are 
required to be successfully reconstructed by both algorithms. Further identification criteria are 
imposed on the muon candidates to reduce the fraction of tracks misidentified as muons. These 
include the number of measurements in the tracker and in the muon systems, the fit quality of 
the global muon track and its consistency with the primary vertex. 

Electron reconstruction requires the matching of an energy cluster in the ECAL with a track in 
the silicon tracker 1^1521 . Identification criteria based on the ECAL shower shape, matching 
between the track and the ECAL cluster, and consistency with the primary vertex are imposed. 
Electron identification relies on a multivariate technique that combines observables sensitive 
to the amount of bremsstrahlung along the electron trajectory, the geometrical and momen¬ 
tum matching between the electron trajectory and associated clusters, as well as shower-shape 
observables. Additional requirements are imposed to remove electrons produced by photon 
conversions. 

Jets are reconstructed from all the PF objects using the anti kj jet clustering algorithm 1531 
implemented in FastJet ||54l, with a distance parameter of 0.5. The jet energy is corrected for 
the contribution of particles created in pileup interactions and in the underlying event. Particles 
from different pileup vertices can be clustered into a pileup jet, or significantly overlap a jet 
from the primary vertex below the px threshold applied in the analysis. Such jets are identified 
and removed Il55l . 

Hadronically decaying t leptons are reconstructed and identified using the hadron plus strips 
(HPS) algorithm Il56l which targets the main decay modes by selecting PF candidates with 
one charged hadron and up to two neutral pions, or with three charged hadrons. A photon 
from a neutral-pion decay can convert in the tracker material into an electron and a positron, 
which can then radiate bremsstrahlung photons. These particles give rise to several ECAL 
energy deposits at the same rj value and separated in azimuthal angle, and are reconstructed 
as several photons by the PE algorithm. To increase the acceptance for such converted photons, 
the neutral pions are identified by clustering the reconstructed photons in narrow strips along 
the azimuthal direction. 


4 Event selection 

The event selection consists of three steps. Eirst, a loose selection defining the basic signature 
is applied. The sample is then divided into categories, according to the number of jets in the 
event. Einally requirements are placed on a set of kinematic variables designed to suppress the 
backgrounds. 

The loose selection for the H —> pze channel requires an isolated p (px > 25 GeV, \rj\ < 2.1) and 
an isolated e (px > 10GeV, |p| < 2.3) of opposite charge lying within a region of the detector 
that allows good identification. The e and p are required to be separated by AR > 0.1. The 
H ^ pTh channel requires an isolated p (px > 30GeV, |p| < 2.1) and an isolated hadronically 
decaying t (px > 30 GeV, \t]\ < 2.3) of opposite charge. Leptons are also required to be isolated 
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4 Event selection 


from any jet in the event with pj > 30 GeV by AR > 0.4 and to have an impact parameter 
consistent with the primary vertex. 

The events are then divided into categories within each channel according to the number of 
jets in the event. Jets are required to pass identification criteria 1551 , have px > 30 GeV and 
lie within the range \t]\ < 4.7. The zero jet category contains signal events predominantly 
produced by gluon-gluon fusion. The one-jet category contains signal events predominantly 
produced by gluon-gluon fusion and a negligibly small number of events produced in associ¬ 
ation with a W or Z boson decaying hadronically. The two jet category is enriched with signal 
events produced by vector boson fusion. 

Table 1: Selection criteria for the kinematic variables after the loose selection. 


Variable 


H ^ pTe 


H ^ pTh 

[GeV] 

0-jet 

1-jet 

2-jet 

0-jet 

1-jet 

2-jet 

Pt > 

50 

45 

25 

45 

35 

30 

Pt > 

10 

10 

10 

— 

— 

— 

Pt > 

— 

— 

— 

35 

40 

40 

Mf < 

65 

65 

25 

— 

— 

— 

> 

50 

40 

15 

— 

— 

— 

< 

— 

— 

— 

50 

35 

35 

[radians] 


— 

— 

— 

2.7 

— 

— 

^4’pe-E^iss < 

0.5 

0.5 

0.3 

— 

— 

— 


2.7 

1.0 

— 

— 

— 

— 


The main variable for the discrimination between the signal and background is the collinear 
mass, Mcoi/ which provides an estimator of the reconstructed H mass using the observed de¬ 
cay products. This is constructed using the collinear approximation Il57l which is based on the 
observation that since the mass of the H is much greater than the mass of the t, the t decay 
products are highly Lorentz boosted in the direction of the t. The neutrino momenta can be 
approximated to be in the same direction as the other visible decay products of the t and the 
component of the missing transverse energy in the transverse direction of the visible T decay 
products is used to estimate the transverse component of the neutrino momentum. Figure 
shows Mcoi distribution for the signal and background compared to data for each of the cate¬ 
gories in each channel after the loose selection. The simulated signal for S(H —> pr) = 100% is 
shown. The principal backgrounds are estimated with data using techniques described in Sec¬ 
tion]^ There is good agreement between data and the background estimation. The agreement 
is similar in all of the kinematic variables that are subsequently used to suppress backgrounds. 
The analysis is performed "blinded" in the region 100 < M^oi < 150 GeV. 

Next, a set of kinematic variables is defined and the criteria for selection are determined by 
optimizing for S/ VS -h B where S and B are the expected signal and background event yields 
in the mass window 100 < Mj-oi < 150 GeV. The signal event yield corresponds to the SM H 
production cross section at Mh = 125 GeV with S(H —>• pr) = 10%. This value for the LFV H 
branching fraction is chosen because it corresponds to the limit from indirect measurements as 
described in Ref. ["4J. The optimization was also performed assuming S(H —>• pr) = 1% and 
negligible change in the optimal values of selection criteria was observed. The criteria for each 
category, and in each channel, are given in Table The variables used are the lepton transverse 

momenta pj with £ = Th, p, e; azimuthal angles between the leptons _ j >^; azimuthal angle 

_ Pt ~Pt 

A(ppt __gmiss', the transverse mass Mj = ■\/2pj£^^®®(l — cos Events in the 2-jet cate- 
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Figure 1: Distributions of the collinear mass M^oi for signal with S(H —> }it) = 100% for 
clarity, and background processes after the loose selection requirements for the LFV H —> f/r 
candidates for the different channels and categories compared to data. The shaded grey bands 
indicate the total uncertainty. The bottom panel in each plot shows the fractional difference 
between the observed data and the total estimated background. Top left: H —> }iTe 0-jet; top 
right: H —> 0-jet; middle left: H —> fiTg 1-jet; middle right: H —> f/Th 1-jet; bottom left: 
H jiTe 2-jet; bottom right H —> f/Th 2-jet. 
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5 Background Processes 


gory are required to have two jets separated by a pseudorapidity gap (| A//1 >3.5) and to have a 
dijet invariant mass greater than 550 GeV. In the H —)■ /rze channel events in which at least one 
of the jets identified as coming from a b-quark decay are using the combined secondary-vertex 
b-tagging algorithm 11581 are vetoed, to suppress backgrounds from top quark decays. 

5 Background Processes 

The contributions of the dominant background processes are estimated with data while less 
significant backgrounds are estimated using simulation. The largest backgrounds come from 
Z —)■ TT and from misidentified leptons in W+jets and QCD multijet production. 

5.1 Z —TT 

The Z —> TT background contribution is estimated using an embedding technique Il33l l59l as 
follows. A sample of Z —> events is taken from data using a loose }i selection. The two 
muons are then replaced with PF particles resulting from the reconstruction of simulated r 
lepton decays. Thus, the key features of the event topology such as the jets, missing trans¬ 
verse energy and underlying event are taken directly from data with only the T decays being 
simulated. The normalization of the sample is obtained from the simulation. The technique 
is validated by comparing the t lepton identification efficiencies estimated with an embedded 
decay sample, using simulated Z ^ events, to those from simulated Z —> tt decays. 

5.2 Misidentified ieptons 

Leptons can arise from misidentified PF objects in W+jets and QCD multijet processes. This 
background is estimated with data. A sample with similar kinematic properties to the sig¬ 
nal sample but enriched in W+jets and QCD multijets is defined. Then the probability for PF 
objects to be misidentified as leptons is measured in an independent data set, and this prob¬ 
ability is applied to the enriched sample to compute the misidentified lepton background in 
the signal region. The technique is shown schematically in Table in which four regions are 
defined including the signal and background enriched regions and two control regions used 
for validation of the technique. It is employed slightly differently in the H —> jiTg and H —> 
channels. The lepton isolation requirements used to define the enriched regions in each channel 
are slightly different. 

In the H —> ^Te channel, region I is the signal region in which an isolated and an isolated e 
are required. Region III is a data sample in which all the analysis selection criteria are applied 
except that one of the leptons is required to be not-isolated. Thus, there are two components: 
events with an isolated ji and not-isolated e events, as well as events with an isolated e and 
not-isolated }i events. There is negligible number of signal events in region III. Regions II and 
IV are data samples formed with the same selection criteria as regions I and III, respectively, but 
with same-sign rather than opposite-sign leptons. The kinematic distributions of the same-sign 
samples are very similar to the opposite-sign samples 

The sample in region III is dominated by W+jets and QCD multijets but with small contribu¬ 
tions from WW, ZZ and WZ that are subtracted using simulation. The misidentified back¬ 
ground in region I is then estimated by multiplying the event yield in region III by a factor 
fji ■ ^trigger/ where ffi is the ratio of not-isolated to isolated }i's. It is computed in an indepen¬ 
dent data sample Z ^ fiji + X, where X is an object identified as a }i, in bins of pj and tj. The 
Z ^ pp + X sample is corrected for contributions from WW, ZZ and WZ using simulated sam¬ 
ples. A correction ^trigger is made to account for the difference in trigger efficiency for selection 



5.2 Misidentified leptons 
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Table 2: Schematic to illustrate the application of the method used to estimate the misidenti¬ 
fied lepton (i) background. Samples are defined by the charge of the two leptons and by the 
isolation requirements on each. Charged conjugates are assumed. 


Region I 

Region II 

(isolated) 

£^ (isolated) 

^2 (isolated) 

£2 (isolated) 

Region III 

Region IV 

(isolated) 

£^ (isolated) 

^^(not-isolated ) 

£2 (not-isolated) 


of events with isolated e and not-isolated ji versus the events with isolated e and isolated 
The misidentified e background is computed in exactly the same way. The technique is vali¬ 
dated by using the same-sign data from regions II and IV as shown schematically in Table In 
Fig. I^left) the observed data yield in region II is compared to the estimate from scaling the re¬ 
gion IV sample by the measured misidentification rates. The region II sample is dominated by 
misidentified leptons but also includes small contributions of true leptons arising from vector 
boson decays, estimated with simulated samples. 


19.7 fb ' (8 TeV) 


Bkgd. stat. uncertainty 
Bkgd. syst. uncertainty 
Misidentified leptons 
True leptons 
LFV GG Higgs (B=100%) 
LFV VBF Higgs (B=100%) 



200 

M(ht 


300 

, [GeV] 


> 

0 ) 

O 


CMS 


$ 10 ^ - 


19.7fb'’ {8 TeV) 
Data, |iT^ ^ ^ ^ ^ 

Bkgd. stat. uncertainty 
Bkgd. syst. uncertainty E 
Misidentified z 
LFV GG Higgs (B=100%) 
LFV VBF Higgs (B=100%) _ 



200 300 

[GeV] 


Figure 2: Distributions of for region II compared to the estimate from scaling the region 
IV sample by the measured misidentification rates. The bottom panel in each plot shows the 
fractional difference between the observed data and the estimate. Left: H —)■ jiTg. Right: H —> 
F^h- 


In the H —)■ jiZh channel, the Th candidate can come from a misidentified jet with a number of 
sources, predominantly W-|-jets and QCD multijets, but also Z —> ^/^+jets and tf. In this case 
the enriched background regions are defined with t;, candidates that pass a looser isolation 
requirement, but do not pass the signal isolation requirement. The misidentification rate is 
then defined as the fraction of candidates with the looser isolation that also pass the signal 
isolation requirement. It is measured in observed Z —> -|- X events, where X is an object 

identified as a Th- The misidentification rate measured in Z —> fiji + X data is checked by 
comparing to that measured in Z ^ jifi + X simulation and found to be in good agreement. 
The misidentified background in the signal region (region I) is estimated by multiplying the 
event yield in region III by a factor /Th/(1 “ /tk)- The procedure is validated with same-sign 
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6 Systematic uncertainties 


Table 3: Systematic uncertainties in the expected event yield in %. All uncertainties are treated 
as correlated between the categories, except where there are two numbers. In this case the 
number denoted with * is treated as uncorrelated between categories and the total uncertainty 
is the sum in quadrature of the two numbers. 


Systematic uncertainty 


H ^ 

jiTe 

H ^ 

0-Jet 

1-Jet 

2-Jets 

0-Jet 

1-Jet 

2-Jets 

electron trigger/ID/isolation 

3 

3 

3 

— 

— 

— 

muon trigger/ID/isolation 

2 

2 

2 

2 

2 

2 

hadronic tau efficiency 

— 

— 

— 

9 

9 

9 

luminosity 

2.6 

2.6 

2.6 

2.6 

2.6 

2.6 

Z ^ TT background 

3+3=^ 

3+5=^ 

3+10=^ 

3+5=^ 

3+5* 

3+10* 

Z —)■ fi}i,ee background 

30 

30 

30 

30 

30 

30 

misidentified fi, e background 

40 

40 

40 

— 

— 

— 

misidentified Th background 

— 

— 

— 

30+10* 

30 

30 

WW, ZZ+jets background 

15 

15 

15 

15 

15 

65 

tt background 

10 

10 

10+10=^ 

10 

10 

10+33* 

W -H 7 background 

100 

100 

100 

— 

— 

— 

b-tagging veto 

3 

3 

3 

— 

— 

— 

single top production background 

10 

10 

10 

10 

10 

10 


}iT events in the same way as for the H —>• jiTe channel above. Figure [fright) shows the data in 
region II compared to the estimate from scaling region IV by the misidentification rates. 

The method assumes that the misidentification rate in Z —> + X events is the same as for 
W+jets and QCD processes. To test this assumption the misidentification rates are measured 
in a QCD jet data control sample. They are found to be consistent. Finally as a cross-check 
the study has been performed also as a function of the number of jets in the event and similar 
agreement is found. 

5.3 Other backgrounds 

The SM H decays in the H —> tt channel provide a small background that is estimated with 
simulation. This background is suppressed by the kinematic selection criteria and peaks be¬ 
low 125 GeV. The W leptonic decay from tf produces opposite-sign dileptons and This 

background is estimated with simulated tt events using the shape of the M^oi distribution from 
simulation and a data control region for normalization. The control region is the 2-jet selection 
but with the additional requirement that at least one of the jets is b-tagged in order to enhance 
the tf contribution. Other smaller backgrounds come from WW, ZZ+jets, Wy+jets and single 
top-quark production. Each of these is estimated with simulation. 


6 Systematic uncertainties 

To set upper bounds on the signal strength, or determine a signal significance, we use the CLg 
method Il35ll3^ . A binned likelihood is used, based on the distributions of Mcoi for the signal 
and the various background sources. Systematic uncertainties are represented by nuisance 
parameters, some of which only affect the background and signal normalizations, while others 
affect the shape and/or normalization of the Mcoi distributions. 








6.1 Normalization uncertainties 
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6.1 Normalization uncertainties 

The uncertainties are summarized in Tables]^ andThe uncertainties in the e and ^ selection 
efficiency (trigger, identification and isolation) are estimated using the "tag and probe" tech¬ 
nique in Z —> ee, }i}i data l59l- The identification efficiency of hadronic t decays is estimated 
using the "tag and probe" technique in Z ^ tt data Il56l . The uncertainty in the Z —> rr 
background comes predominantly from the uncertainty in the t efficiency The uncertainties 
in the estimation of the misidentified lepton rate come from the difference in rates measured 
in different data samples (QCD multijets and W+jets). The uncertainty in the production cross 
section of the backgrounds that have been estimated by simulation is also included. 

There are several uncertainties on the H production cross section, which depend on the pro¬ 
duction mechanism contribution and the analysis category They are given in Table These 
affect the LFV H and the SM H background equally, and are treated as 100% correlated. The 
parton distribution function (PDF) uncertainty is evaluated by comparing the yields in each 
category, when spanning the parameter range of a number of different independent PDF sets 
including CTIO t60l , MSTW |M]/ NNPDF as recommended by PDF4LHC Il63]l . The scale 
uncertainty is estimated by varying the renormalization, /tr, and factorization scales, }if, up 
and down by one half or two times the nominal scale {fiR = }ip = Mh/2) under the constraint 
0.5 < }ip/}iR < 2 |64l. The underlying event and parton shower uncertainty is estimated by 
using two different PYTHIA tunes, AUET2 and Z2*. Anticorrelations arise due to migration of 
events between the categories and are expressed as negative numbers. 

Table 4: Theoretical uncertainties in % for Higgs boson production. Anticorrelations arise due 
to migration of events between the categories and are expressed as negative numbers. 


Systematic uncertainty 

Gluon-Gluon Fusion 

Vector Boson Fusion 

0-Jets 

1-Jets 

2-Jets 

0-Jet 

1-Jet 

2-Jets 

parton distribution function 

+9.7 

+9.7 

+9.7 

+3.6 

+3.6 

+3.6 

renormalization/factorization scale 

+8 

+10 

-30 

+4 

+1.5 

+2 

underlying event/parton shower 

+4 

-5 

-10 

+10 

<1 

-1 


6.2 Mcoi shape uncertainties 

The systematic uncertainties that lead to a change in the shape of the Mcoi distribution are 
summarized in Table In the embedded Z ^ rr Mcoi distribution, used to estimate the 

Table 5: Systematic uncertainties in % for the shape of the signal and background templates. 


Systematic uncertainty 

H —> f/Te 

H ^ jiTh 

hadronic tau energy scale 

— 

3 

jet energy scale 

3-7 

3-7 

unclustered energy scale 

10 

10 

Z —> TT bias 

100 

— 


Z —> TT background, a 1% shift has been observed with respect to Z —> tt simulations by 
comparing the means of both distributions. This occurs only in the H —)■ jiTe channel. The Mcoi 
distribution has been corrected for this effect and a 100 % uncertainty on this shift is used as a 
systematic uncertainty for the possible bias. The jet energy scale has been studied extensively 
and a standard prescription for corrections Il65l is used in all CMS analyses. The overall scale 
is set using 7 +jets events and the most significant uncertainty arises from the photon energy 
scale. A number of other uncertainties such as jet fragmentation modeling, single pion response 
and uncertainties in the pileup corrections are also included. The jet energy scale uncertainties 
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7 Results 


(3-7%) are applied as a function of px and t], including all correlations, to all jets in the event, 
propagated to the missing energy, and the resultant Mcoi distribution is used in the fit. There 
is also an additional uncertainty to account for the unclustered energy scale uncertainty The 
unclustered energy comes from jets below 10 GeV and PF candidates not within jets. It is also 
propagated to the missing transverse energy. These effects cause a shift of the Mcoi distribution. 
The T energy scale is estimated by comparing Z —> tt events in data and simulation. An 
uncertainty of 3% is derived from this comparison. The uncertainty is applied by shifting the 
px of the T candidates in the event and using the resultant M^oi distribution in the fit. Finally, 
the Mcoi distributions used in the fit have a statistical uncertainty in each mass bin that is 
included as an uncertainty which is uncorrelated between the bins. 

Potential uncertainties in the shape of the misidentified lepton backgrounds have also been 
considered. In the H —> pze channel the misidentified lepton rates are measured and 
applied in bins of lepton px and p. These rates are all adjusted up or down by one standard 
deviation (cr) and the differences in the shape of the resultant Mcoi distributions are then used as 
nuisance parameters in the fit. In the H —> channel the t misidentification rate fr is found 

to be approximately flat in px and p. To estimate the systematic uncertainty the px distribution 
of fr is fit with a linear function and the rate recomputed from the fitted slope and intercept. 
The modified M^oi distribution that results from the recomputed background is then used to 
evaluate the systematic uncertainty. 


7 Results 

The Mcoi distributions after the fit for signal and background contributions are shown in Fig. 
and the event yields in the mass range 100 < Mcoi < 150 GeV are shown in Table The 
different channels and categories are combined to set a 95% GL upper limit on the branching 
fraction of LFV H decay in the pr channel, i3(H —)■ pr). 

Table 6: Event yields in the signal region, 100 < M^oi < 150 GeV after fitting for signal 
and background. The expected contributions are normalized to an integrated luminosity of 
19.7fb~^. The LFV Higgs boson signal is the expected yield for B(H —>• pr) = 0.84% with the 
SM Higgs boson cross section. 


Sample 


H ^ pTh 



H ^ pTe 


0-Jet 

1-Jet 

2-Jets 

0-Jet 

1-Jet 

2-Jets 

misidentified leptons 

1770 ± 530 

377 ±114 

1.8 ± 1.0 

42 ±17 

16 ±7 

1.1 ±0.7 

Z —> TT 

187 ±10 

59 ±4 

0.4 ± 0.2 

65 ±3 

39 ±2 

1.3 ±0.2 

zz,ww 

46 ±8 

15 ±3 

0.2 ± 0.2 

41 ±7 

22 ±4 

0.7 ±0.2 

Wy 

— 

— 

— 

2±2 

2±2 

— 

Z —> ee or pp 

110 ±23 

20 ±7 

0.1 ±0.1 

1.6 ±0.7 

1.8 ±0.8 

— 

tf 

2.2 ±0.6 

24 ±3 

0.9 ± 0.5 

4.8 ±0.7 

30 ±3 

1.8 ±0.4 

tf 

2.2 ±1.1 

13 ±3 

0.5 ± 0.5 

1.9 ±0.2 

6.8 ±0.8 

0.2 ±0.1 

SM H background 

7.1 ±1.3 

5.3 ±0.8 

1.6 ±0.5 

1.9 ±0.3 

1.6 ±0.2 

0.6 ±0.1 

sum of backgrounds 

2125 ± 530 

513 ±114 

5.4 ± 1.4 

160 ± 19 

118 ±9 

5.6 ±0.9 

LFV Higgs boson signal 

66 ±18 

30 ±8 

2.9 ± 1.1 

23 ±6 

13 ±3 

1.2 ±0.3 

data 

2147 

511 

10 

180 

128 

6 


The observed and the median expected 95% GL upper limits on the 6(H —> pr) for the 
H mass at 125 GeV are given for each category in Table Gombining all the channels, an 
expected upper limit of 6(H —> pr) < (0.75 ± 0.38)% is obtained. The observed upper 
limit is 6(H —> pr) < 1.51% which is above the expected limit due to an excess of the ob- 
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Figure 3: Distributions of the collinear mass Mcoi after fitting for signal and background for the 
LFV H —> /TT candidates in the different channels and categories compared to data. The distri¬ 
bution of the simulated LFV Higgs boson sample is shown for the best fit branching fraction of 
S(H —)• /tt) = 0.84%. The bottom panel in each plot shows the fractional difference between 
the observed data and the fitted background. Top left: H —> /TTe 0-jet; top right: H —> /tt^ 0-jet; 
middle left: H —)■ /TTe 1-jet; middle right: H —> /tt^ 1-jet; bottom left: H —> /TTe 2-jet; bottom 
right H —> /TTh 2-jet. 
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8 Limits on iepton-fiavour-vioiating coupiings 


served number of events above the background prediction. The fit can then be used to es¬ 
timate the branching fraction if this excess were to be interpreted as a signal. The best fit 
values for the branching fractions are given in Table The limits and best fit branching frac¬ 
tions are also summarized graphically in Fig. The combined categories give a best fit of 
S(H —> pir) = ( 0 . 84 ^ 037 )%. The combined excess is 2.4 standard deviations which corre¬ 
sponds to a p-value of 0.010 at Mh = 125 GeV. The observed and expected M^oi distributions 
combined for all channels and categories are shown in Fig. The distributions are weighted 
in each channel and category by the S/ (S -|- B) ratio, where S and B are respectively the signal 
and background yields corresponding to the result of the global fit. The values for S and B are 
obtained in the 100 < Mcoi < 150 GeV region. 

Table 7: The expected upper limits, observed upper limits and best fit values for the branch¬ 
ing fractions for different jet categories for the H —> process. The one standard-deviation 
probability intervals around the expected limits are shown in parentheses. 


Expected Limits 



0-Jet 

1-Jet 

2-Jets 


(%) 

(%) 

(%) 

JiXe 

<1.32 (±0.67) 

<1.66 (±0.85) 

<3.77 (±1.92) 

HTh 

<2.34 (±1.19) 

<2.07 (±1.06) 

<2.31 (±1.18) 

f/T 

<0.75 (±0.38) 

Observed Limits 

}lTe 

<2.04 

<2.38 

<3.84 

HTh 

<2.61 

< 2.22 

<3.68 

JiT 

<1.51 

Best Fit Branching Fractions 

}lTe 

BTh 

0 87+0'66 
-0.62 

0 41 + 1-20 

0 81 

0 21+1-03 

0 05+1-33 

U.UO_0.97 

1 48+1-16 
i-+°-0.93 

}IT 

0.84l«;f7 


8 Limits on lepton-flavour-violating couplings 

The constraint on 13(H —)■ jir) can be interpreted in terms of LFV Yukawa couplings 01. The 
LFV decays H —)■ eji, ex, jix arise at tree level from the assumed flavour-violating Yukawa 
interactions, Y^a^/s where denote the leptons, = e,}i,T and 7 ^ The decay width 
F(H — 7 in terms of the Yukawa couplings is given by: 


and the branching fraction by: 


B(H ^ = 


F(H ^ 


F(H 


+ hsM 


The SM H decay width is assumed to be Fsm = 4.1 MeV Il 66 l for Mh = 125 GeV. The 95% GL 
constraint on the Yukawa couplings derived from S(H —> jix) < 1.51% and the expression for 
the branching fraction above is: 


y^|Y,„|2+|Y^^|2 < 3.6 X 10-3. 

Figure [^compares this result to the constraints from previous indirect measurements. 
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Figure 4: Left: 95% CL Upper limits by category for the LFV H —> /jt decays. Right: best fit 
branching fractions by category. 
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Figure 5: Left: Distribution of Mcoi for all categories combined, with each category weighted 
by significance (S/(S + B)). The significance is computed for the integral of the bins in the 
range 100 < Mcoi < 150 GeV using B{¥i ^ }it) = 0.84%. The simulated Higgs signal shown 
is for 13(H —>• jir) = 0.84%. The bottom panel shows the fractional difference between the 
observed data and the fitted background. Right: background subtracted Med distribution for 
all categories combined. 
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9 Summary 



Figure 6 : Constraints on the flavour-violating Yukawa couplings, |Y^t| and |Yt^|. The black 
dashed lines are contours of i3(H —> for reference. The expected limit (red solid line) 
with one sigma (green) and two sigma (yellow) bands, and observed limit (black solid line) 
are derived from the limit on i3(H —)■ ^T) from the present analysis. The shaded regions are 
derived constraints from null searches for t — )■ 3/r (dark green) and t —> f /7 (lighter green). The 
yellow line is the limit from a theoretical reinterpretation of an ATLAS H —> tt search |4|. The 
light blue region indicates the additional parameter space excluded by our result. The purple 
diagonal line is the theoretical naturalness limit < mirrij/v^. 


9 Summary 


The first direct search for lepton-flavour-violating decays of a Higgs boson to a fi-T pair, based 
on the full 8 TeV data set collected by CMS in 2012 is presented. It improves upon previously 
published indirect limits (H |23 by an order of magnitude. A slight excess of events with a 
significance of 2.4 a is observed, corresponding to a p-value of 0.010. The best fit branching 
fraction is 5(H — > }it) = (0.84]'^q 37)%. A constraint of 5(H — > }it) < 1.51% at 95% confidence 
level is set. The limit is used to constrain the Yukawa couplings, VlY^^P + < 3.6 x 10^^. 

It improves the current bound by an order of magnitude. 
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